Add support for multi-run graphing in ilab / Crucible backend #120

dbutenhof · 2024-10-14T13:01:07Z

Type of change

Description

This draft PR is backed by PR #117 to add support for overlaying metric graphs across multiple runs. It'll be rebased on main and opened for review after #117 is merged.

Related Tickets & Documents

Primarily PANDA-600.

Checklist before requesting a review

I have performed a self-review of my code.
If it is a core feature, I have added thorough tests.

Testing

InstructLab CPT is using a persistent Crucible controller system in RDU3, tied to a 4-way L40S test system. The data store (a private OpenSearch instance) contains a set of Crucible runs capturing both training and SDG runs.

GET localhost:8000/api/v1/ilab/runs?benchmark=ilab will query the ilab.crucible OpenSearch instance and return a list of ilab benchmark runs.

Add Crucible readme file. Cleanups and refactoring

Also added the option to override the default graph title generator using the new `Graph.title` field.

This cleans up my direct API call to get the run's periods for graphing, to use a separate action and a reducer. I also experimented with trying to improve error diagnosis by looking at some of the error responses to "toast" instead of just saying something went wrong.

Add a Crucible `close` method, and use a FastAPI yield dependency to ensure every API connection is closed cleanly.

+ other review feedback

+ add some method documentation + misc review feedback

Multigraph API failed if more than one `Graph` element specified the same run; fix to be smarter about missing run IDs. This also contains experimental code to expose per-iteration param values, which doesn't quite work but doesn't seem to hurt anything.

(And `/api/v1/ilab/runs` reports iterations in numerical order.)

Move the legend up off the graph (although 1.5 is arbitrary and maybe not ideal -- is there a more dynamic way to do this). Also, I fixed the name of a `.less` class earlier, but just happened to notice the corresponding use...

Move "unique parameters" accordion back up with parameters using the new expansion toggles.

dbutenhof · 2024-10-25T11:48:34Z

Subsumed by the new #125 and #127

dbutenhof and others added 29 commits September 27, 2024 16:30

Add ILAB / Crucible support to CPT backend

4094cc9

GET localhost:8000/api/v1/ilab/runs?benchmark=ilab will query the ilab.crucible OpenSearch instance and return a list of ilab benchmark runs.

UI code updates

e6c8ee0

Improve periodic graph names.

5e735a7

Add Crucible readme file. Cleanups and refactoring

Documentation and cleanup

505903b

Also added the option to override the default graph title generator using the new `Graph.title` field.

Allow overriding graph color

4ccd603

Some (self) review cleanup

de0accc

Cleanup OpenSearch connections

5671d77

Add a Crucible `close` method, and use a FastAPI yield dependency to ensure every API connection is closed cleanly.

Try to remove a couple of incidental changes

a49fc65

Undoing a few more ancillary changes

1d5783d

Review feedback

f20e45d

Pagination and Date filter issue

a959fe6

Rewrite param consolidation

79151ea

+ other review feedback

Add framework for UI multi-run comparison

8b75d5f

Some UI cleanup

c8cd597

Debug unhandled exceptions

3b0b190

+ add some method documentation + misc review feedback

Fix multigraph bug

b32a606

Multigraph API failed if more than one `Graph` element specified the same run; fix to be smarter about missing run IDs. This also contains experimental code to expose per-iteration param values, which doesn't quite work but doesn't seem to hurt anything.

Support for per-iteration parameters.

2148311

(And `/api/v1/ilab/runs` reports iterations in numerical order.)

comparison

312eebf

render graph

0d7592a

Pagination for Graphs

e056d62

Support relative timescale graphs

659d820

pagination data

e25b82e

Multi run comparison adjustments

993a4fe

A few tweaks

de38738

Move the legend up off the graph (although 1.5 is arbitrary and maybe not ideal -- is there a more dynamic way to do this). Also, I fixed the name of a `.less` class earlier, but just happened to notice the corresponding use...

closing of graph accordion

25fe58f

Adjustments

eed488f

Move "unique parameters" accordion back up with parameters using the new expansion toggles.

conflict resolve

606f468

multiple APIs to fetch periods and icon to display more info

e755221

dbutenhof closed this Oct 25, 2024

dbutenhof deleted the multirun branch October 25, 2024 11:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for multi-run graphing in ilab / Crucible backend #120

Add support for multi-run graphing in ilab / Crucible backend #120

dbutenhof commented Oct 14, 2024

dbutenhof commented Oct 25, 2024

Add support for multi-run graphing in ilab / Crucible backend #120

Add support for multi-run graphing in ilab / Crucible backend #120

Conversation

dbutenhof commented Oct 14, 2024

Type of change

Description

Related Tickets & Documents

Checklist before requesting a review

Testing

dbutenhof commented Oct 25, 2024